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My topic concerns spatial displays, and a constraint that they do not place on the use of 
spatial instruments. Much of the work done in visual perception by psychologists and by com- 
puter scientists has concerned displays that show the motion of rigid objects. Typically, 1 
SSSS5& arc rigid, one can then proceed to understand how the constant shape of the 
obiect can be perceived (or computed) as it moves through space. Many have assumed that 
rigidity principle reigns in perception; that is, the visual system prefers to see things as n ^ 
Thercarenow ample reasons to believe, however, that a rigidity principle is not always Allowed. 
Hochberg (1986), for example, has outlined some of the conditions under whic a ngi j 
ought to be seen, but is not. Some of these concern elaborations of some of the demonstrati 
that Adelbert Ames provided us more than 35 years ago. 

There is another condition of interest with respect to rigidity and motion P^ption. 
is not only must we know about those situations in which rigidity ought to be perceived, but is 
not4”Lmutta, 0 „ about those conditions in which rigidity ought not to be percetved, but ts. 
SeTadtesTone of these conditions, with respect to cinema. But before drscusstng enema, I 
must first consider photography. 

When we look at photographs or representational paintings, our eye position is not usually 
fixed. A puzzle arises from this fact: Linear perspective is mathematically rorrect for only one 
station point, or point of regard, yet almost any position generally in front of apicture will do o 

obiect identity and layout within the picture to appear relatively undisturbed. Preservation of p - 

nomenal identity and^hape of objects in slanted pictures is fortunate. Without them the utility of 
pictures would be vanishingly small. Yet the efficacy of slanted pictures is unpredicted by linear 

perspective theory. 

This puzzle was first treated systematically by La Goumerie inl859 (seel^enne 1970). I 
call it La Goumerie’s paradox; Kubovy (1986) has called it the robustness of perepectwe. The 
paradox occurs in twoforms: The first concerns viewing pictures either ^de Bo* 

proper station point; the second and more dramatic concerns viewing pictures from the side. Both 

are shown in the top panels of figure 1. 

To consider either distortion one must reconstruct, as La Goumerie did, the geometry of 
pictur ed £ space) behind the picture plane. The premise for doing so is that the image 
plane is unmoving, but invisible, and that observers look through it into pictured space to ™ke 
sense out of whafis depicted. Invisibility is, in many cases, obviously a very strong, if not false, 
Gumption, but it yield" interesting results. Possible changes in vie^ig position ^ £ong ti* z 
axis orthogonal to the picture plane, and along the x or y axes, parallel to il Both generate alhne 
SSS in all xz planes of virtual space. Observer movement along x or y axes 


also generates perspective transformations of the image, but these will not be considered here 
(Cutting 1986a, 1986b). 

In the upper left panel, four points are projected onto the image plane as might be seen in a 
large photograph taken with a short lens. When the observer moves closer to the image, as in the 
upper middle panel, the projected points must stay in the same physical locations in the photo. 
Thus, the geometry of what lies behind must change. Notice that the distance between front and 
back pairs of points of this four-point object is compressed, a collapse of depth like that when 
looking through a telephoto lens. All changes in z axis location of the observer create compression 
or expansion of the object in virtual space. When an observer moves to the side, as seen in the 
upper right panel, points in virtual space must shift over, and do so by different amounts. Such 
shifts are due to affine shear. All viewpoints of a picture yield additive combinations of these two 
affine effects-compression (or expansion) and shear. 

Such effects are compounded when viewing a motion sequence, as shown in the lower 
panels of figure 1. In particular, an otherwise rigid object should appear to hinge and become 
nonrigid oyer the course of several frames for a viewer seated to the side. Theoretically, the 
problem this poses for the cinematic viewer is enormous— every viewer in a cineauditorium has an 
eye position different than the projector and camera position, and thus, by the rules of perspective, 
no moving object should ever appear rigid. This is, I claim, the fundamental problem of the per- ’ 
ception of film and television. 

Most explanations for the perception of pictures at a slant are in sympathy with Helmholtz. 
Pirenne (1970, p. 99), for example, suggested that "an unconscious intuitive process of psycho- 
logical compensation takes place, which restores the correct view when the picture is looked at 
from the wrong position." Pirenne's unconscious inference appears to unpack the deformations 
through some process akin to mental rotation (Shepard & Cooper, 1982). According to this view 
the mind detransforms the distortions in pictured space so that things may be seen properly, and 
although Pirenne didn't discuss film, it might hold equally for film seen from the front row, side 
aisle. The force of my presentation is to show that this view is not necessary in the perception of 
slanted cinema. But first consider how this account might proceed. 


Pirenne and others have suggested at least three sources of image surface information that 
might be used to "correct" slanted images-(l) the edges of the screen, which yield a trapezoidal 
frame of reference; (2) binocular disparities, which grade across the slanted surface; and (3) pro- 
jection surface information such as texture and specularities. Since I am interested in none of 
these, I removed them from my displays through a double projection scheme, as shown in figure 
2. If one considers the situation of viewing slanted cinema, one has the real, slanted surface and 
one can measure a cross section of that optic array from it. This would be an imaginary projection 
surface. Once considered this way, one can reverse the two, placing the real surface in front of the 
imaginary, and this is what I did. 


In this manner, although the display frame was always rectangular for the observer, the 
rfiapes of rotating stimuli were like those seen from the side, with the right-edge elements in each 
frame longer than the left-edge ones and with the z axis compressed. This simulation yields a per- 
spective transformation of the image screen, and a nonperspective transformation of the stimulus 
behind it in virtual space. I presented viewers with computer-generated, rotating, rectangular 
1987 S ) relevant to this discussion. (For a more complete analysis see Cutting, 
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for their separate discussion). 

Second stimuli were presented with cinematic viewpoint varied; in Experiment 1, half 
were pro^aXewed <L .he conec. station pete. . h*« ^ 

compoundsthe nonrigid deformations of .he stim- 
ulus in pictorial space with an additional perspective transformation of the image. 

Viewers looked at many different tokens of all stimuli, and used a bipolar graded scale of 
rigidity Ind confidence, from 1 to 9-with 1 indicating high confidence in nonngidity, 9 high 
confidence in rigidity, and 5 indicating no confidence either way. 

Figure 3 shows *e resets of, tht 

oT^reen. tmd’Lohd, stimuli were 

seen as equally nonrigid regardless of simulated viewpoint. 

The lack of difference in the slanted and unslanted simulated viewing conditions is saildng, 

bu, ’. could be due .0 * a < x^^^inff^luced. Its 

duccd sl third viewing condition, a. steeper angi • . o _u c^vrpral neonle 

imnetus came from structure-from-motion algorithms in machine vision research. Several P<*>p 

suggested to me that screen slant could be another parameter in ngidity-fmdmg ^gon*ms and 

both rigidity and a dynamically changing projection surface. 

This time stimuli were generated in near-parallel and polar perspective. Again, stimuli 
could be rigidor nonrigid. Selected results for the nonrigid stimuli are shown in i figure 4 and 

. rr Fir<it the variable 67° screen slant condition was not different from the 

show two ^mgeffects ““ seem to embarrassing for any structure- 

Ihrough-nK>tion approach to die perception of these stimuli that includes screen dm as ^ able 
to be solved for. Second, if simulated screen slant is great enough, all stimuli begin to look 

nonrigid. 

A more interesting result is an interaction concerning near-parallel and polar projected stim- 
uli as shown IrigTre^ “lith the two 67° conditions collapsed, and all rigid and normgid l mak 
collapsed The near-parallel projected stimuli show no difference in perceived rigidity ' y 

angle P that they are viewed; the more polar projected stimuli, on the other hand, show a sharp 
decrease in perceived nonrigidity as the angle or regard increases. 
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T^ 1S ^ a ^ er e ^ ec ^ adds substance to other results in the literature. For example, Hagen and 
Elliott (1976) found what they called a "zoom effect" — the general preference for static stimuli seen 
is more parallel than polar projection. Here, in cinematic displays, stimuli that are near-parallel- 
projected are seen as more rigid from more places in a cineauditorium. 

In conclusion, let us be reminded that photographs and cinema are visual displays that are 
also powerful forms of art. Their efficacy, in part, stems from the fact that, although viewpoint is 
constrained when composing them, it is not nearly so constrained when viewing them. The reason 
that viewpoint is relatively unconstrained, I claim, is not that viewers "take into account" the slant 
of the screen, but that the visual system does not seem to compute the relatively small distortions in 
the projections, at least for certain stimuli that are projected in a near-parallel fashion. 

It is obvious that our visual system did not evolve to watch movies or look at photographs. 
Thus, what photographs and movies present to us must be allowed in the rule-governed system 
under which vision evolved. Slanted photographs and cinema present an interesting case where 
the rules are systematically broken, but broken in a way that is largely inconsequential to vision. 
Machine-vision algorithms, to be applicable to human vision, should show the same types of 
tolerances. 

But with regard to the use of camera lens in movies, it becomes quite clear why long 
lenses— those that are telephoto and nearly telephoto— are so popular and useful. First, and known 
for nearly a century, standard lenses tend to make people look like they have bulbous noses. Sec- 
ond, and corroborated by my results, long lenses provide a more nearly parallel projection of 
objects, and the distortions seen in these objects when a viewer looks at a slanted screen are 
significantly diminished. This enhances their efficacy considerably, despite the fact that it 
introduces the nonnatural situation of collapsing the apparent depth of a scene. 
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Figure L Reconstructive geometry and images. The upper panels show the reconstruction of four 
pillars in depth. Consider the left-most panel a representation of the real depth relations 
projected onto the image plane. If that plane is now a photograph, the pillars are fixed in 
position on the image plane. Thus, when an observer moves toward the plane, depth must 
be compressed, as in the upper middle panel. When the viewer moves to the side, all 
pillars slide over by differing amounts. The bottom panels show reconstructions of a 
moving square across three frames, from two viewpoints. Notice that the reconstruction 
for Observer 1 is rigid, but that that for Observer 2 is not (from Cutting, 1986a). 
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Figure 2. Arrangements of real and simulated projection surfaces that can remove image informa- 
tion from objects projected onto slanted screens (from Cutting, 1987). 



Figure 3. Selected results from Experiment 1. 90° and 67° are the two viewing conditions of inter- 
^ est, where 67° is the simulated screen slant as indicated in figure 2. R = rigid stimuli, 

N = nonrigid stimuli. 
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Figure 4. Selected results from Experiment 2. The added conditions are simulated screen slants of 
45°, and one of variable slant (between 80° and 55°), averaging 67°. R = rigid stimuli, 

N = nonrigid stimuli. 



Figure 5. Another description of the results of Experiment 2, parsed according to projection. 
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